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Strycture-g uMect Analysis Reveals Nine Sequence 
Motifs OonsiTOd among DNA. Amino-mtlNy I- 
transferases, and Suggests a Catalytic Mechanism 
for these Enzymes 

Thomas f$al0aie\ Robert W> Biummtm** and Xi&odoog Ctiertg 1 * 



Previous Xrny erystalb^xaphic studies have roveated d:a*t the mmhtk 
domain of a DNA w^crs^faw (Mtsse) generating CSonedrykytosme 
hms a sir&isxg stricture] similarity to dxal of a Mts$?: generating 
N§-med)yiad^nh^. Guided by thl% eooonon stvmm% vye pedor^xid a 
multlpte sequence alignment of 4;> anaoo-Mtases (N8>ademr^ end 
N4<ytosinep lbs compax&ox: revsai&ti nine conserved moti& mnmpond^ 
log to the motifs ! to VIII and X previously denned in CS^eytosIne fcs*; 
The amino and CS-cytosine Mtases thus appeer in be mor* closely rsted 
than has been appreciated. The amino Mtases couki he divided km d-ree 
groups, base::! on -be ^xpieutud order of motifs, end this varl£tk;n in order 
may axphex- why only (wo roolife were previously eecogxfe->x| in the mmm 
Mtase><. The Mtases grouped in this way $bow several other group epeedfc 
pmperUea, indudlxig differences in aniino arid sequence, nSoleeular mass 
end DNA sequence specificity Surprisingly the K^<:yo>ah^ aud 
^-•adenine hitases m not form separate groups. Th&sa xesults have 
initiations for the malyde meehaohims, evohjtioo axid d!vx:raidearkxn of 
this family of enzymes. Furthermore, a corapaxadve analyst of the 
S adenoyyR, unetbinnme and adenine Aytose u> binding pockets suggests 
that structurally and .tuncUo:oalh; tixey are remarkably similar to one 
another, 

& Aradeini;: LU**«! 

Keywords: mo\m acid miueiicz motif; catalytic eteehaniere; DNA 
nHa:hyhraosiexase; S-adenosyi-L^nednon!ne: structural similarity 
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■UNA Mtase* transfer xnedna groups horn 
S ademsvia,:a^ri^i)riinc (AduMed tusoaeifie pos- 
itions Oxidases in dmdae- branded DMA., The DNA 
Mt«s fall into two major cesses, dolined by the 
position medryfated. The members of one daas 
tmilwlm a pwimidine ring ca&bm v&idlm* 
Clhnk^tetosine (SmC; e g. HM Mrase. Mifedh 
Momhers of the second, class methylaie exocyrfk 
amino nitrogens, foxxoire? eidw '-^-^kit^yt^jbnim 
CNSmA; e,g> M/laqx) or N4^^hyfcyteit^ (K-lxnC; 



Ahbx«?v1attnn$ ami AdoMet, S^idt?no$yk> 
rnedxinna^:: Adorlcy SadeoesyiA ■hee^^ystane; 

N4?)C, BUi^k^m; N$mA f m oxedxyledeeine; 
arwiwc* Mmm mmmm. N4m€ or NtSrnA: 

QfMi^ x^eerho! ()-e^thya:rare;-ten^c; CM. ronsecved 



e.g. M.Fvnh). Mraees of tlxe twv> classy wote 
exoected to be subsikoti&iiv difikeixt froxix one 
another, hased on i.he imk m^m of 

rnetfiyi bramfer are vsry different This substnstt? 
diference mn bt? iilustrated by the ^spectivc 
o -charge dexBxtxea for the rn^tiwW^ 
gexr.d^s. which are -eOJ^ on the exoeyclie mrnm 
gxonps of both adenine and eyiosine, and - 0.03^00 
the ^carbon of cytokine (Reuti^op^dakrishnan eizL 
1S7I), Do Mtam from the two dks&s, io (m, differ 
subiaanUaily from -dm m&Msf? 

Arialysls of gene sequences one suggested that 
die two Mtase classes are xixhte dldersot. All 
bacteria! 5xnC Mtases, and a €Mi>m vkw> SmC 
Mtase, contain a: set of ten conserved fcfcks of 
mim acid residues (1 trirou^ix X: Posfai ei dL ISM: 
Cheng ® a/, : 1983a; Kumar efeL !S94;Lanater etah. 
:iSS9: Som e? -ah, 198?}. These conserved motifs 
have the same linear orfe, which mri0f\m their 
identllkaiion in pxioi^ey $ouuex^:te§. The^e ten core 
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ssemd.m^' <m mn pm&ni in the carhoxw 
tmmml ~ amino add mtOms of ttmik. 
homarn and Av^tmnm CoG SmC Mtaaes (Beamr 
at aA. ill Scheldt et eA HAM; Fhmepan & Deooxa. 
hB3; Gumhner al, 18921 In z®mm< 
sligjBrnaia an*** amine acid seau&oess of the am&x;. 
Mto ft* not jx>wated >Md\ mmvvmm (see 

'Mm.mf. Dm Mtasea \mi been dmrmmmm 
strraluoxUy two modi's of :1mG Mimm were 
a^gasd j&nril^al rolea. Motif f ftlW cor? of which 
U dmm ^swrya a G*y*&$v eepuerm such as 
AiaH^GIv- Lo^-Gly eh m Mi&I} was pmmxwd 
to he pari m tk Aekdvlet Modis^ sit*. This 
?$$\grmm%. based on tfee pr^s^^ee of this 
Qyrich &?quer&s in a wMe v^issty 'of AdorvfoKfe- 
pendent Miasea In addidoo. so th« §ni€ M»s, 
inched N6ro& sma KfotC DMA M«« and 
RNA, iTOio, md amah mfeute Mtasea £KU> 
r6^U.te-<* a/. 1989 Ir^:rom> & ;A. i88& Smith 
at aA x&SO; Whson & Morras; m\: Kwm & Clarke, 
19M} The other mod? to which a rote cookl 'be' 
imi&ml mm nmf IV. which contain an invariant 
dfpepdtfe (Pro-Cy^b The Cys In tola motif is tim 
aedve site midmphih, and .farms & mtmmt 
cdvafeM hood to (to xbxwbon of the ro^ihvtehb 
oe (Sartfi <k ah. 1984; Wo & Sanit i98T; 
# $L mi: fmmnm A Amml Smith 
at a?.. S«:Ww«i^al. mi: Itekef aL 1393; 

m & 'm$m> mm £hm h ism. urn mMm 

Mases lack a Pr&Cys dx&s$3ti*te. 

Structural aoaiyak. however, has tbuoxj snikiog 
similarity helweex> DNA Mm of dm iwo classed 
lotoxxoadoo on the soxieOxrea of DNA Mtaaes drat 
cams from studies of M.Hfcal and M Jbql whkA 
life:: most froox- type II resoxcdon -oiodxth 

cation $pin^ are arrive m mmmmnc enzyxnexa 
them $mim Urn provkied irrsighia irao Mtaac 
dooe^n or^oi?zotioo and its relatiori^hio to the 
eoxmrvod ^queo<e o-ratfo (C^ceg $1 J, 1^3a; 
iahahxr H tm. The :>troerurc of an Mifel- 
ON A coniph^ provided fonder Inaight oho the 
hotcttoos of several conaerved amino acids hv,pli- 



rated In DNA mmrm mecBdm catalvsfe mti 
mMM. hmdm (CHeng et ah : 1993h: Khina^uskae 
^ *L BUh Mjm m idkied ioto two broad 
dox^alo5v^ catalytic domain that co^n$ the aeiivo 
silo and AdoMepbiodh^ regions; xmcS a DMA ■rec- 
ognition, region. The stature of completed 
who AdoMet; Is abo bilohot (hahahn er a/.. !8M). 
Tbfe bliobal struehrce may he « $&m$& property of 
DNA Mtasea. as it rm Wo b&Si seen iaOowsn^ 
iiroited proteoSveis of M.EcoEi (Retch cr af : 
and MPoeil (G M Adato^ & R. Ah BiurneMhal. 
unpixhlisliod mints). In (xmitmt,. a sir#e dnro;:dn 
sirnctixre has been deiorn^ined for eateclKa O-nKnlt- 
yjtramferase (CQMtaae; Vtdgmo ef A, 1894}. 
Catechol like cytokine, is a asxvmarofeemd ring; th:fe 
imil molecule eoo o<adtw differ toto die "active 
site of CGMtase ibr medndtra^far horn AdoMet. 

The mumstd coxnpajlson of three AcbMk^ 
P*M» rmzktim thtr^lpc toaiosof 
trie hie>ha? pcotetos MMml and M,«h a:od the 
endre doxr^ato of COMtasw ah akhibit very 

siinhar iSa'ee^dloter^ionai fokhon &idca:ki!>^c 
^ aA ISM) The recently pohl^hed strueto:^ of i:h« 
5otC Mtaae MiMl (Reioiach ct ah : i«) is also 
eoo^Atetit with thia iAldrng pmMm. This sixnihtnty 
h'Ctodea the positions of mm.no aeki :oxkvch;;oxe> 
iovoh^d in etd^er Ark^kn binding or eataxym in 
other wox<b, mmiy of tf^c conserved motife in the 
caedyde do?nain: of UMhal hm: atcoctora! ran 
o a>k# th the other t wo Mtaaes (D- Gara . #t si , imSl 
'fhia soggesta that roaoy (if riot all) AdnM^a -depen - 
dent Mtaaes may shao- a eoroeaox^ eaodyak doo:taxo. 
^troctvnao h ao> thia not ooiy ahowa atrochJreJ 
predledoos for other AdoMeo dependent Mtx^e^. 
but also provides a framework for atrenipts io 
compare dieir seqoences. Cnkkd by this coxn-noo 
cataiy tie domain atroetore, we parfcrn:^e<1a n:iidtjph: 
aequeoee aiigraraoa of 33 NemA arid 8 NimC 
Mtases, Our reaulta revea! that she N4nr€ md 
Nim\A Mtasea are more closely related to mm 
mother aod m the oxoC Mi<m& thax; wa?; a^pecied. 

Tola wnrk eoodrnrs iim the amino Mtaaes belong 
to tlxree gf;enpa diatlrtgoyxed try didhreneea in the 




Figure 1. Sequence .ahgfayssnt of 33 NfeA DIM A Misses sM a N-hnc: DNA Mtams. AcGrono «. &Gmm & €. Group 
y, Mhtifs (1 m XPare ^ahcled oshag ^rnorneoc^atura of Posfa; ei st ii$m, ami saciix^occs .«n>- <:ax:oped dote y) oaton 
the oexnenciamre of Wsto (imi Censer ved aaena ^ich are ^enxexi ^ :T. f> Q/Nj. {V; h/l MUF. T Wl (C. E 
Ah tit 18 and (S, To uaiogxaandaed <?ae- iener ^b^revixalwer xavar a;ra jjxamo adds; wuhox a exeon m sbewe as -white 
lettera apinat a black h^kgmuod. coawvsd hv.dropho.bte pomw® m indkmd by hoki \mm m a ahsded 
baekgrouod, and .conserved [xaar ae dm®ati. paatrioas hy bold Meters wahm a hiw, f ester degrees of $mmw®m are 
ahewxv ie deer^waayarder, by hokl ajx! upp^rcas^ keers, wbxie nc-ox:oeo>ex^d posksae-; are ahawn a$: iowarras-e lettera. 
A {-•) iadk^M a ddevxOrX^ m E^xi of she three groups of Mtaai?a is preceded by a theorettea'i 

topological drawlr^. i<xaaox^ko (!e»terecp exbeiite l^fe> aod s«rows (nuxnberad) depicr ^kands Ccmaeeved aexfoa 
acida foen xoatifa a e> K) lo^e circled aad thatr peskier m kacrred from the ^icneture'i cokipartsan of MJtMl md 
M.lml xSchlaekebka ;a af . l^e:0, la ackiitina. the sec?>n>:iarv stracrore-: of ahown in groab y;. are tadMed by 

cyrjoders Oxelkes) and atxov^ po aod^) <lmm dime:?xv ^bove the amine ackia fo: : tnxn$> daoo. the armoo acid ae^xeoxcS 
of MJitei i$ %xnC'hkm$ aa<i n. m m& xmkmk. <XMtem are also wvi« la e,map ? fer coaxo^isae. n Madf Kcouid 
not be taenbhed ibr MJb&m (bdkoA ex e^fnop y) n^ ^aaaenee P(bd03. but wm rea<hiv fenod io the ae^oenee uaed 
by Whaan (i aa;d, who renaraxt aa airernabw; atert poskiao that tacraaa^ the aiao of ibis Mm* Iraar m mmm aear 
mkhtm to SR H The Mmes Ml and Stal are each doable cose hitases with ;sw>a>::>xvx: jxaives (Soa^aakt at ah 1« 
Kka o- $L )W&> atKl each hdf was axxaJyiced Jxxk-pndexxdy 
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ifoe&r orders of c«md motife in thsir primary 
^donees, Together with our abservatoi that the 
AdoMet and m^Wk^k htm binding pmm 
hm> iWmmhty simikr mvmimm, this suggests 
cattle rotes tor e^al of the ain^md" kdse 
cb&ina and has imgii^ira"®- for the evetotionary 
history of tee : ari?yr». 



The structural similarity of the sKttve sid? and 
AdoM^>bhding regions of M$$$ m those of 
M,Mkt $ug&^is thet the amino Mm^ eo.mato 
hoixmom of tho conserved motifs fauad to 5mC 
Mm. The ^qusmres o£ NBmA ami N*lmG DMA 
w«re dserefora gathered and sn&Sy&sd as 
\n MMsmk add Methods, We W*>re pre^ 
mnxi for %wz mm<m, w look for tlmv nmih to 
linear orders im mm mxsn$i\w SmCMmms, First, 
others had noted that the two previously kfetjffed 
cowvd mod& to amtoaa MS&ses $$*pfia*$d h> 
dif&i$ftt orders Id th& various Mtasss (Kiima^ 
-aft***tf& !«; Wife & Mra* MU Wita. 
$$2}.. feeod. odass had shown thai the SmC 
iAwm. eouid fonetioo wah a etotoedy connoted 
motif order (J, Bttmsho. pene>o;d mmmjr\kmon}, 
or •w.feMlhsi regions vvco^ ^pr^^d:^pn^y and 
allowed ib &s*odate m vhv (Karreman % do V^ard. 

ftafei H si., 1801; Lee * ah. imhWe were 
ahfe to Identify nine segments of sequence 
.sfonbdty among the 42 smino Mms (F%u^ 1}. 
eon ^ponding to oioofe I to VM end X to *h« SmC. 
Mfcsea imtki n mm. mmM *x;Udendfv a 
homolog to motif IX of the SoC Mt&i<$; M MMh&l. 
this motif h involved \n the protein tokhng of the 

U too stew lWW end MMqL media I 
to III and X or* prtosadfy responsible, for binding 
AdoMet iCh^o m nl, iBM^b: Klitm^&km et £[ 
mM;.Umm etaL mi;MAmMmx IMS), 
erid w tertn fern, coHecdwIv the AdfeK^i^hinding 
region. The strocrorai comparison su^g^ted that. 

M Vi and VHl ^re primarily responsibfe for 
eataly^b (Schhsefebier ^ ^l, 1§S5}« as thev .form rh^ 
active site akwwith mod^ V and VIL and wo terro 

Tftrw groups of 



F%x.3xe I clearly %lwm that the N8mA.MS«*s 
tlmim- km ten? d^dnct groups., ha^ed on the orcfer 
of <x-morv^d modfo h ^ no^wo^ tdy tdat the N4mC 
m4 mm& Mimm <k not Wp^tely from 
one aoodieiv I hfe geooping compared to earher 
^oaiy^es .to the Discussk>n, 

Tlk v.^k% of the grooptng shown in Figure 1 
which h imtii mUly-.m oiottf order, is supported 
by the fci dM the Mtases within each gre>up sire 
stellar to <soe a;oath^r by > 




First, the grmrps differ to imm of type of 
0«hytoqn (Figure 2A): N&nA Mmm are idood to 
ah thr^e grcjnp^ hut group $ ind?idee e^hf of the 
oio^r N4mC fvfee$ snalymb end groo|> 7 ;hM a 
modf order wry i>itntSor to that Men in the sroxnrof 
all U m$wvice<i 3mC Mmm {dffistiw ^ Irehie 
p^ittoti of motif X: Ko^nar ^ ef , !#4}. Second, 
oornparehte motif seqoenoe^ wkhto earh group axv- 
nm^Mmihrto one ar^ofer then to da? mme mohfe 
from Mioses- in one of th« <m<cr grotips IMtxi the 
groups older to Unm of tnoieouler m^, with 
group est Mtases being small PO to 334 amino acid 
midues), gronp y hfems* be?ng lar^e o> 580 
amino eetd residue^, and group g eoverr:^ both of 
these s&e raoges (228 to Sal «rnioo aeld reskiues). 
Foiu iih, the g:x>op3 Ao differ to .terms of DhlA 
^ec|oen<:e reoo^ooi-ed. Thai is, a 4teaacrcowsemu$. 
can be derived e^h^group: vMte each .group 
$n<dude>: sper^keies; that do not :meieh the 
corisemuie and aoaie Mtase speefhottk^ Could In 
mosne than one mo^nms ; it m clear tim the r^oog- 
nginn ^perihestie^ are non^ratidoeev d^trihuted 
mm Ae iodioated In Figtr^ !. IE/ IZ 
grono :>'■ N6mA Mtases recognize 
(C/G}MN:,, 0 7IG/C} (M A/C; nnr 
oate^ the aiediyiated h^se). ii/l? pup 
reengage the ^eqi^enee (G /GN^.< 

«?n$smus sohstrate ^p^rjficei^ adow verifiable 
predictions. For oi^nphr h:-^ o\Kieo;.ide a?x|uenee of 
ihe CM 'Mtms gene has not h*on reported, hot ha 
^pecideity fATCGAt) euggeyta that it will he found 
to hebog m group ?, 

The Mtaaes' differ tn the eektiw one^r ord^r of 
three regions: the AdoMetdaodiog region, the 
eataSytk (active aite) re^jon. and the target 
roeogoition region (Figure 2b to die S^C MtaM<<i. 
the target r^oidoo regi;>o la c^poo^hfe for 
specif ic'DN A §e^t:ienca r^oogrtttjon , miiMi generally 
located within the iox)gear gap benv^n 
motifs {K3imaM5.^k^ et afo Mi & Roberts. 

Noyen VVeMi^r & IVaxnner, ^9:>). Group =x Ls 
arranged in the order %mm to rarhoxy}: A<bMe;.- 
bindiog region, target r^eo^antbn regi&K and then 
catalytic region. Group p is arched so the order: 
csedyoe c^giou.. target T^eognidon rege>n. and 
AdoM<e-binrhng regiorr Groop y is arrax^ed in the 
order ; AdoMet ibio<B;>g regkai .catalytic te|gon ( and 
Oitget teeogriition region. No Ml:o:>es were f ound to 
have H m predicted Ado-Met binding region hctvve en 
the other two regions figure 2A : arra:ogemet^:^ h 
md Q . No Mtase- te< known to have the target 
rect)gnidoo region at the amino end {B&nre 2A, 
arrangements ^ arid Orlsewfew. MA%d'"0\%nA,. 
assigned to sroop y} ; M,Cn : 81 and M.i&ahd (both 
N4roe, signed to groop M h^ve ior^ foiQO amiuo 
acMl reatdoe) ^atn^o-proKioia^ e^:{ne??res tipstreiBo 
of & ikat eoiieerveri ■m&St and in theory these 
upstream sequence* could contain the enget 
re<wetem regions lor timm tbnre Mtaeea If so. 
M.VapO cored be a^igr^d m go-up c: and M.CfrBi 
and M.B^.faHl to greop t, 
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Figure £ A> ftm&te srr^gm« sf tkis^ tmpr: t%$m i'muti in UNA fcg$as<& % im timt thx t^m^mmm 
of tH<*se sirrstii^^ IM&aisd for KStow tfm pm;» %mC, HUmA, or N4m€, 7m m^xi& t) r<?fe ?:o the 

SmC.Mwas. ha*$-«n$ dUfe^m ^c^ the group r^iw Mt« mm Xis nmx (M zkthmv mrMm m 
5mC -Mtase*; &/im mpummm^ mzntplm of $m& N$mA- a&d mmC Mire** fcv mojit IV and shw1h$the 



rotetiv* jaotfiSc«» offfe otbar ox®uryed motifs. The Io^toM*!* region, us N$fc«&3 ^ the puiatw targeu^w^^^ 
region im&iistf m xm f .&•.>}. in xmx erne. 



c 



?i«iir* 1 Si^WBJtfe^ two «/^ du^ from DKA Umm, inr^m* repr^mk^ (Cm:>m, mi). Too 
&«0^d du^l^j^j & :>->t ?>t:.vxc<-is troo; of &o prlraary #x*«K*w of MHM or U/tml $m laumifc 

tf-'&.--i9§4}.. A, .&> MiiM. The swo s/p tkmm pi • •< *A -> o:B (shown Hi greoo with Cdoop ia yd low) 
and ' :xD •»* |H> ~* 3$ b.rmvn with Pdoop in oyaxi) woco isol&tfcd k'om tho coayml^^d 

^l.HME^A-Adv>Hcy ^rootoro (A), sxsd t^tgftgj with raspec* o> m# mxilm to ad^ovs She-most wlfpi?| 
(B). Too $>sh$&& trim* ihi\ two os/p dtf*t«r£ couki ho soponroposod with Un tm.$.6. of <l A for too C*' stoms. Also 
shown arti tho pos&tes. rdostvo to too rs^poctlvo dustors. of the Adooky asterc&yi oxooty (pooon) and tho tat gs* 
cyfcastae nng $>t<mrii C, IX Fronted a$ dfe&F&fid for MAI, oxcopt thast 8m brgstdboha? ring h mi &&&& 

dnoo m *x?m.mti position has oot yot boon dotormiood. 



Comparison of oons&rvisd ms*tl?$ among the 
families 

White the osdo:e : of cp«rwd motifs varte 
a?y^i>g tim fiixm groups of Mtosoo, mmy of 
their basso festurfts- mt: t by dolioidoo/ to- 
Mm*± Thoso con«vod feabuks &rs< d^crfbfcd 
•below. 



This ftiolif be&o d&odhod It* & v&doty of 
AdoMot-dopoodont Misses feo Introdueifen), 
Stnsotars^ motif ! forms tho socosndarv sifx&am* 
Ill -ioop-xA (Flgm* t; Schojokobiot ■■<£aCi£$5>- CH. 
and lm> frequently Ala or Fro : farm the loop 
(G-ioop) tbm. hmtis tin imMmim moiety of 
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mmm. The. core orihro -loop & oiy^Qfy 

itlp^pMe. IH. is" my mnm aeldf hut xfivm group y 
Mtases in jure I recdaee thefimiilv wir:h Ala or 
Sac (M,&^SV w»w (sis oft to m group a) 
rapkee t:he Secoud G3y with or*e of seven 
alternative The rmjodtv of ammo Mmtm. have 
Pre aa the imtmnm acid of atrand pi flVo4§ in 
M.%1), asma of 13 Mfam in group a (ami 
the SmC MtarcO fete & or Leu at sfet* position 
tlmt? in Cpm*rye<3 Iwdcophobie side- 

chains In strand, p! m% required for packing 
aptost bebx The only motif ! rule w&houf 
^c&ptio&s- among the Mtaaes In i mvaiv^ 

the position 4 amino ackfe upMream of the 
Gly^Gly which is, In all esses, Asp or 'This 
Is the .^ulrtroa® |K&mon pf fM (Figure II and 
poses addutonM mmxhzmcti mmxzmu by 
^afctk^wU*i the %o?es of the psptkfe Ixonds to 

The most ptro^ motif I dif¥&&*K$ among 
the Kfeses is that both gh>u$xs « arid ft a* well a$ 
S&iC Mtasca< !« Phs at the besmnm^ of d»> 
Qdoop (second po:dtk>n amino to Civ X Civ), hot 
group y mmt hm or Cly \mm<S: m 

iwoMlmm mmmxm \ \4dmxthb pattern (MJi^RI 
Id group y and UJ&dl in group p}- in the 
mxid.\m of the 5mG Hme Mi/oaf thra Gdoop 
PhelS forms an edg^-to-te van dex Waaia (vow) 
tmttici with the '"'^to moteSy of Adoptee 
tfewfcvfer, in thg atrucluxe of M,%jl (a member of 
group y)\ the same tme^don with AdoMet is 
provkied by the B»ti€ rim iVom behx 
|$<:hitM>kii)fe: & sL 19851 It aopaaos that. In ^four? 
■v. rhaPh* ihsfil^gijis she G loop in the SmC Mtasea 
is rsplaced by a spahaUy ^iiavak^it Phe from helix 




flfe U ancf tU 

Time two moiif^ were d^rihed as k^-con- 
.^rvtxJ '.blocks- m SniC Mt<^e<- mrd y ,. IMS). In 

struciumily cii^srJerj^d Mt^s, m&ff 0 
eomaim &t&p&v®ly ch'axpd. amino aeid at the hM 
in itwui $2< krmmim With the rite 
of AtloMet, arid fdlmM by <s binky 
h^ds a q>^bte si<te.^hain th^t: ?mfe vow 
with the AdolvM adenine {G!o^T?p in MMiml 
Chm ik m M.lkifi). Of the 42 mitto MiM&> m 
Figure 1. 30 nu^tch s fGtu/A^4>-<^ssfeosm a wltero 
ti> to any bulkv hvdroohobu: ^kfe-chatn, t^utdly 
iMow^d hy Asp, Qkt t& Mo. 7\m gm*m do mi 
-differ $ubstantia!iy In tbss.. 

Motif ill abo contain a:n Asp/Gin or A^n/Gin to 
th^ fet po^idon of lAm^ ^ MMhal and Ass>89 
In M,i^b> whkh Ini^niOB dlmetiv wti;h the 
mKvdk Nli> fN§) of the AdoMe? adenine 
(Schhid^bior ef Motif ill in addition, 

provides a hydrogen bond to Nj ->f ihz AdoKlet 
adeni?^ from a pepdda haokbojie NH mmp (IleSi 
m.MiiM ^ilWSC^to MJ^I),TIie com?^mlir^ 
po^jtloo k groopspocifea^y con^ervod In the 



Mom w 

What we call motif IV of the entlno Mtesfes was 
found in -wtv ses:p;«?j!tc^ compari^om and called a 
't)PFY txxm <} bkm no tfe sequence (Hatooan 
er a^. 10^; Gbamfraee^ran & Snath, nm A b^tor 
comparison of 18 NSnxA and thn?e N4mG Mtases 
iifentihed ooiy vw&- conserved {K.b- 
ma^i;i?te zi ,*G imi r>m of which li motif f The 
other conserve m&mM was the DFFY n^otif 
which, it wm &$gm& m%hi con^pond to n^obf 
IV hi 3mC Mtaaes ; ov^n though the reason 
n^echaoisn^ appear to he ouhe dbtbrt The 
structural comparison of MMhal and MJbqi haw? 
eoote^d this eorre^ondence (Sohk?e.kohkt.r «t aL 
mSl tm diprolyl motif located in the loop 
regbn mmide the carooxyl end of jhf {ih« P-lo?^; 
Fk-un? I), The P loop forma the a<0.w> sen:, alone 
with mollis VI and Wl (aoe Dbcos*e>of The 
peptide, backbone of the imrmxmidvia P imp m 
M<im mo contributes to m AdoMet hhJdhjg site 
(Cheng, 1995a; see also Kossykh M I883f Motif 
iV has the eon^er^u$ seouer^e Asp-Pn>Fro lvr 
(DPP>'} in gruop a, DPPY for N8mA Mtases in 
.groop )t. Aso Pro-Prn- Ax (NPP¥) & wmp y, aod 
Se^Pro^Pro^Iyx (SPPY) for N4:nC Mtasea: this 
grouping pMtern for motif IV has keen noted 
previously (Wilson ■. .Murray; 1S91; WOsore 1992). 
There are eseeotsons to this partexn. inost rtotatav 
M^smKi (N4mC: in r>coui> & which has a DPPFi % 
MJIMU (mmA in group fi, OPQ¥) and MM% 
mmA tn group a. DTPY). 

mtftv 

In s>xOi.:.:> y. iootif V irnntatns the eonsenaua 
{Asn/Aspj-Le^elyr-XA-PhedLeo/val/^ As de- 
scribed <few?. kngroup y, this Phe rcpiacca the Phe 
that 'begins the G loop In the Mtasea of groups ox 
h. The Leu (LeuP-0 in MJfel. Leu 143 M M/M) 
jimkm vd>vv rnotact^ to the AdoMet adeni^i on the 
same sale as the Phe (Schhickehie? e? $L !§95).. Of 
the kltases to Figure ! . only one thai lacks Phe at the 
stare of the Gdoop fails to cox)tato it in xooufV and 
that is M JMIL which has a Ik ®. that poM. ]p 
grot^ps « and ff uoe of the cnoserved i^deophrobk 
side rhauw In motif V may have the same spatial 
position as the Leu b; group v Miasca, 

in the snuctu^]ly characterised M;aaes ; rooilfVi 
fbfms strand p (Figure 1: SeMnxd^hier ^ A, l§95f 
A conserved Giy starts the strand, and the strand 
ends wSifa Cly Pro or Aia : while in group « a 
exsda with Ser-Asn. Groups y and P also differ 
froth gxoup in havbig a coaaerw^d pattern of 
iwdrnphni)k: axnino acicte In 

Motif VI! is m>t sttPpgly: conserved ewn arnon$> 
$mC Mtaaes : , yet credible "candidate^ can he fennd 
wlthhr esioh gkiup. In M.HMl das oiodf blades 
AspI44GVr to the loon between helix a£ and mm$ 
06. and it mm away ffam the DNA blndlo^ deft 
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(Ctag $t <C mm. it : fe thus tao&v&d to & 
Involved in the hddim? of the catalytic wmm 

(Cheng, mm* 

In iM primary s^«$fle& ;»?f VII! baars hltfe 
ressmsbianee to \he motff pmsni in -SroC Mtasss 
£GInt»^ in Mdi&UA Tim pm- 

$imMy ^fleets tho feci that the 5mC Mtases 
im&sd With irytoslm v& h?d^g&r* bonds (through 
A:rgi85 fo M.ife& ? while the NlrnA Mtpes appear 
to Sn&ract wkh the target DMA adetiihg 
hydrophobe ter^^om fe th^ $tructo^ dfM^I,- 
tb<r c<rrr^po^1kig region (the loop sxzmetiikl 
stranda W* and g?) contorts- .Fht*i'96 ( which aligns "to 
a Phte: or !y? in other amino Mtasas, It Is 

lihm P\m)M makes feyt^bie edgMd-i^s 
vdw s&jifcaets to the target DNA 
adenine §Mix<Mm .4 *f* 1S9S), 

M?#rX 

The foeation of mcfeif X m the primary s^qa^ 
ia one of the major differences between the SmC 
isikS amino Mtases, In the SmC Mtases. ihii motif: 
oornos -from, the rooms y torn nous. In the amino 
Mtases -the eorresrKaxliray myxli always to ife 
amion sfcfe of motif 1; at ihs? termini of the 
protein in group a and 7. and to the middle of (he 
protein in group [1 There are pronormeed group- 
speettk: difeemes On ahe eecp-iene*i of bits rand? 
(figure 0- Hosvevsr, m M Mxmm< 8m motif & 
expected to Term a helix next to amend £f (formed 
by motif l), with eonsorved hydrophobe .sfcfe-chafos 
required px carta to posidons for packing against the 
S :Strantfe ? and a loop preceding foe helix {Figure* i). 
This- loop, along with, the 0- -loo?? of motif I arid die 
Fdtfop of motif l.V< form the skfe of the binding 
w&kzt in MMhM for the na?tbioome moietv of 
AdoMel (Cheng, i99:Afo 



groups 



Structural comparison of 



The idemiheatioo of nine eohserwd rnodfs shared 
with the $mC Mtases allows the amino Mtases* from 
each group to be mapped onto the consensus 
struct ore in a wsteoume manner {Figure !). The 
rooet peoriotrneed differeriee amods these three 
p« of amino Mtasoa ia eonnecdoo hetsveao 
the pmpo^ed AdoM^oblndh:^ and eataiytk re~ 
■gidnsi, in group y. a eoooeeUor; between hehx #C 
mxd nmrd H h:oka the mo region: M :Ml hmm^ 
to thfe. groop. M.Hh$ mti C(Mi$m afeo belong to 
gxpop y ; . hm$4 m the order of the conserved nioilk 
0gam P axrepdog «lf K to caso of die SxnC. 

Mimv mmwMd InimmMon h ftom Mirnm wrth : 
e^e;otiaih; a gr^p y nmiit order, in groups and 
% tho iv^i regions' are appareotiy enmieoted vm a 
separate dornanr, target reeognittoo ration. The 
e^alvtk and ^doMot^trair::^ r^tmns of these 
corikl no^rtheie^ fu the eonser^ua 



MJ-m-MJkfi ^tructore. Wratd^r tiaa aotodiy 
occurs ia ^nr^ntly betog axplon^h as aover^l mm 
DNA Mmm, tm uo<fegob^ crj^taUographie 
anahem ..As the gmttp u and 0 Mtasoa ara pmpeked 
to tern ibs? DNA recognition domsao between tba 
AdoMei -hhidiiig and catalytie regtooa, it Merest 
h)g that other atrucrurahy efa^rktesl^d pmzim 
have a reeogfUfioa doxoaio irra^ted between 
set^enees that idem a cmW&c ^-ahaer elnster (for 
exaon>fe, the G proteio, Cmmm M aL m& the 
KFvult eod<:aaj.c:kaae. Chenn & a/. and 
two flavopostatns S« ch0eroot tiomyim ir^erted 
hemeex: pari^ of FAD-hh ^joe domain 'iMM & 
Schxih, mt: Setaeurier et at. 



What ean the eor^eoaoa MJMl-Milkqt structure 
and the cooaervatkai of oine varateriee n^ki& ^Oiong 
the anaoo and amC Mts^ea teiO m afenot the 
poasihie catai vtic meehirnaea of tWNfesA Mtaa^s? 
m propose d>ai the answer m thb quasdno hea In: 
stmmpm&m of the b<ridb>g ehea for DNA adenine 
and .for tho adenosvl moiety of A tfoMet;. which ate 
stnkh^y simhar. While no NlhnA Mtaae DNA 
eo crystal ^rrvKtu^ has yet been demeaned, the 
conservation of strnotore and fin-etjon among 
AdoMeodepoodent enzymes sopporOeJ by both 
the simtiar atrnetuod hana^ork of the taa^lvtie 
dontain^ found in M:HMh Md%I ; and COMraae. 
and by the similar eoaformation of the bound 
AdeMk wirh the n^ethyi &roup positioned ' im 
^irpri^ngfo) rinse to the aid^tr^e (Sebiod^hter 
er at., f^SS). ThU smx^amhkmcimn eon^arvation is 
also sus'ge^t.ed hv the eonaeivatka* of amino aek:te 
from moors I, II lib V sod X >vbteH in rhe 
souctnrahy characterised Mo^es. inten;ci wkh 
AdoMet. " 

Are th:e fiNA-a^atosyl and AdoMer-ad<o*osy1 
hind a ^ site ^ atriie tnraby eon ■ parable? They are 
oaeii dominated hy eomparabie -x/& elosteis 
0\. ~* ah ^ p -xi and ' p4 ~* a:!.) ^ "jJ5 - ^E): 
the former mcfodea raotife I and U< a^l IMda rhe 
hoib of the AdoMetddndirp reborn and ti>e Mm 
ants IV a> VL and forms ti 



inebaiea namb IV -o VL and forms the hoik of the 
catalytic region. These two eiosters arai their 
botoid sobatrat^a do. in foeo have strikingly sandas 
strneturea, Tne two clusters bom: t:he Mif^I- 
DNA-S-adeno^yldAjomoeyakane (AdoHoy) atroa- 
tore can be superimposed .with a ro;:aane^a>^qtti«^ 
deviation &MsM of <I A for ths C : atoma in the 
$ aaxands, with the AdoHcy adere>^yi oa>iety 
ovex lapping the target cyioajoe vim (eigure 3A 
and Bb Similar over!aj>pi% is also pdasihfe for the 
&/$ dmmm of the M diad- A doMfer srnartnre 
Wimm 3C arid D) and of the MJ&eill -DNA 
siroetore {resuh;s not ahown). While the various 
Mi&m grmps have ttee two eimmm in 
didorent orders (Fiprtes \ and 2) , in noaa^ae haa the 
motif order ^earraiigtarr^xa: i;:aer:arot:ed an r- /g 
cfosiar (Ftgt^re 1), Tte reiatadriess of the binding 
pocket for the DNA base and m A>:k-Mar may 



explain mi mmmik® immm of the MLH*&ili 
i&rmw*- Ohmsdi si' 4. :1S8S}-: the unnaired 
3* ihymme of one "DMA duree* peoeo«s -the 
Adofet pocNf of fee neighborm* Mta«-BNA 
oomph?*. The ihymim doss not mim doepy mmgh 
to Interact "with the mmwd acidic amk*o add 
^i^ctes feee s^jfcto on rootife Tl and III), hut 
does make the ■ hyd^l^e comSc® made by the 
AdoMet adeoosyl nmm, such as lkevtodk:e 
mckxng with lyr39 or motif II (analogous to Urp41 

?#m<i on-&mth^n^l and guttural similarity of 
the DMA -adenos vS and AdcM^^ad^fxc^vl mote t*es, 
and the $iructt«*) xfma&my of .the AdoMekbtnding 
and cataSytie regbm of Hum, wa pwe 
aoamgous Mmx- d<&mm® m&mmm in ihe'tWo 
^gmm Cite 0 The tafethyfetk^ of aefenira 
sppaas* to ^serit from-ss dimi; akaek of die A doMet 
fl&tiM worn m the adenine N6 (Pooolotd et ak 
1MB; Pio e* &k !§&}•. 

In analogy to the hvdrnaeo b«a<j behyms 
AdoMee adenosyi No and a mohfB! Aso m H.Hhd 
md M;im we suggest thai the N$ amino tm^en 
of the target e^miM is m donor m a hydn>gen 
bond to the skfe ehalrs of Aep/Aan hi motif 1Y and 
pd^ifeiy -m om of the mm&<3mhi mygms. of th® 
adj^.eni two polim? reskioes. I hU: w6\M negs> 
tWy pokake N& aaSvsilafj k for direct tramfe'of 
fb&-€Hj from AdoMet. ItvMtases with Asn in this 
pardon (group y) the earboxandde eorid he the 
donor m a hydrogen hood to adenine Ni , as weh s<s 
an acceptor from adenine MS. zinnia?- to the cole 
Asn228 of • hynodvl^e synthase plays in hydrogen 
hondh^iodyMP \Lh^ &Wi ife also see Pitcure 
3 of Cm.. m4). Mxmm wkh Asp. m M* poskte 
eoukl afeo hydrogen bond adenine Hi If the 

Consistent with the above, tantetioo of DPFY to 
GPPY or APPY In the two helves of die bifooetkeia! 



Mi^e M.mi (group m ^xmm aedvitv in the 
altered half Bu^ak; eiM.< l«h Alierb^ DPPY to 
SPPY or NFPY mmlmm im «ebvh:y of the m^uo a 
NSoiA Mtese MMmDm (G^yot ^ <d., 1993}. The 
M,EeaD*£m restih is sotnewhat surprising.; m NFPY 
& in motif IV jo mm? of 12 Mteees of group y 
(Figure 1), whik SPPY is in nmtx IV m mxMtk&> 
of .gfpu|3 p a-Tid even one (M.Mvai) of s>roup ^ (eeo 
the following mtmx mi NAM: Mimsl SMfek 
al^rhig MPPF to DPPP m "M'JotKI led in km of 
^dv% (WiHeock et ^ 1984), $UmW & a type I 
NOinA Mtase dtOt :ha^ also been roodeied onto the 
comen^ns stracmre (Dry«fe?i, et at, 

mS) m interpret these d^ta from: nmtam: enzymes 
to mean thai ikz mmM} pomk>m of. ehe .mivmify 
hydrogen bond accentor/ ue-get en h en g:nan>, nod 
AdoMel tnethyi groop n^uet be oreeikly main- 
tai.n.etk 

'Ihe two proline residue* ere no; present m ah 
eKamples otmotd lv: Mmm i$mm & DFQY) and 
y,SM-a (grot^p s: D-FFV) r^emhk the sKNA 
N6mA kkases in tim tmpm. Analvste of )2 rRN A 
N'SntA kitases (bC 2i kffi ^wa6 die eome?^ 
lN/S}iPtY/F) (X. Ch^ unpmUM 
vatteh- Akerh^ ne>ttr IV ofehb ba*tohw T4 
Dam NOxnA klt^e horn, DPPY to DAPY or DTPV 
(<b neerire in M.So;!-*} auhetandaliv increased 
Kr^k Inn i>ad vwtch umlkr effects on A, C; Kf fe ^ : ^ 
and (Kc^kh er ah. 1^), This congests diet 
the Pro alteration ahkets a catalytic but not a 
rate- ikn it big {ten iW-determkdng) step, conabtent 
with the aho^o inferenees {partteukeiy tf produot 
release is the retedtmkhig step, as k is tor /a least 
soo-^e Mtases; Rekb & M^booo. Ikifernn 
nateiy these motatinm runo not be-?o snade in 
MJksB&m bat in \M{ mumm. dmwm DPPY to 
DGPY DYPY DPGY DPRY DPQY iaa'-oeeurs in 
M Hfeilh DPEY or t)PVY eh abohabed aetivttv 
(Goyoi ei aA P8§3j In snnnnary these tesnits en? 



gkr k Ce;r-p.^isee. or ee; DMA oaenesyi coxi A<kAjet^-edeno^i blad ing sfe in group f 
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be^O evtanee NS <^ ^e^ ?o Gaiety in rtm$ Vi of te.HM. 



exigent with the rafeu^tsd for moist IV but 
afe> make dear the danger* of considerir^ the 

The lyr In motif IV Phe in mold' VIII, arid 
Mfopba^c'side-^hoi«j§ in -motif VI could fo&atoii 
in properly orfemiog the iar&et DNA adenine. Thy 
amtogy far m lytcPbe pair is the : Pfa« from moof 
V (group or motif I igt&um %md $ thai malm 
m edgwtofaee v$w wm% with tk* AdoMst 
adenine nbble 1], end to the ■ Trp41. rT motif h in 
MJIM* whkh itiskm & fee -to-fe? v# mnfM 
with the AdoMet afehe Thb ^rp^miion is 
with to ikt that ai^rir^ motif IV uf the 
MJteaKi N0raA Mtase NPPF to NPPG or 
NPPC ahoiiabed activity whhoot greatly rfetb^ 
the af&nty tor DNA or .AdoMer {Wllicock et a£ 
W4*i Jte $mtrm< altering sh& NPPF to NFPY or 
NPPW (as .occurs? hr Mw%;T nm^ed m an ensvme 
retained part Sal aebvkv (Wdicoek a a/.. lilMh 



From the ffcst mpurt of <m N4mC Mi^ 
^uengs. it has fc&i auggesied, based o*v overall 
m^rm timftstiiy &od the very similar c Mmlc^.l 
prppgtfte of adoriilne No and eytosine NT k&. see 
¥\gim SMmimt eisL ISM: Renugopataknshnan 
et a/.. n?Ti} ? that N4;oC and NSroA Mtass :nev use 
« etymon wmitim ^te^m (lao.^ $L m§S, We 
befeve that cytokine and N*Vadeotoe med;yh 
Mm do im the ^me catalytic msdt&nim -far the 
Mkm'mg remom. Fml. m rfeerihed ahave, the 



N&uA md'mmC Mmm iji group p appear so be 
mm: d<mly mlmd to am a^her to elte 
subgroup is io. the N$s*A Mtaaes o:f group y, 
Seeorai the NfeA arid .N4mG ^sfons of 9oU& are 
eijhferii^feh^iifei^fe-fnsm one arMha\ or- are not 
commmy diffeeot from one another (the excep- 
tion Are-usuallypmvJcfed by MM^I or M^aaiHl. 
Fo? eKarapfe : the most obvkxis diffe^:j:«:^ batw^rt 
•k? N4mC and NBav\ Mtase sequence, b the 
coBserv^d S§r p^iaan; br xm$M IV sh p\$&$ of 
A^p/Asn: vat this Sor rnost tx* repr«^m ^ 
^ntM mncto^! difeac^. m li & not t>tk%m in 
the N4mC M^MtatHl. m ^ug^t that the Ser 
in motif IV of m^i N4oC Mt^ could hydrog^ 
b^nd m& mwm tba aadm N4 mtrogen of the 
evtosin*? stag, to arjalogv to COMtasa, m which a Ser 
hvdr^a4:«^ ia the AdaMat (Vldaren 
^ a/, 199^. and in analoav ta tha -Ast> in motif ill 
Df du: M&nA. Mtaaea fialao 11 Theaa Mraaas may 
<aao ba bydragan bond doria?a tp ayto^jna N3. 
aimibir to *M p:^>osed boadtng of NfeA Mtasss 
to adarao-? Nl ; but lha do^r skia^ehata would bo 
!x<aa out^kfe moilff¥ foe^ibiv a ««Md Aan ia 
mm W m^mnm for N4 mmmm of 
ayioaioe mf ba Implied by tha f^:i dM the 
iadlvkfeal raWfe m M.M^L whtk nnan*bb>uoa'dy 
\n tha poop k: order, mora cbsely raaaxobla tha 
cort'asoatKih^ m&m of tha o?:bor f^roup -8) H4mC 
adguroiy 



and 



li quid 'be that dHi trulls pf our snafets 
are wry oona^^ot with, and provide a atrocroaiii 
tes for, earner attempts to :%rtmp Mt^oa. Tho tet 
of thaaa adomm* ?^mimd' I? M:» v and waa 
b^ad net on rnndf Ide^Sc^ka^ and ordv>r bra ot^ 
ovw^3! aoqijoaoe align meat (Chmdmaapyan & 
Smith X§8§}< That at^lys'b found ftva ^rntjpa af 
Mt&sm< Their group I: iaolndad tonr M^es ad in 
<sur group «; tknf grmp M inautdad two Mt^a^ 
both la group p; thoir g?oat> HI indoded Mix 
Mmm> oil h) graop y; thair group IV ioekid^d six 
SmC'MT&ses (which w^ did rkt ir^lude, but which 
Haw s groo;p y xootif order, - except for fe poajtion 
of aaMifX).. arid their group V cor^iatad of M,B«1 
which baa Mv^ral varkdous torn the ammmm 
motifs, im which we aasiga to s>roup y h<mi$ an 
m^tf order. The $oeoad m*jor arady^b a^taaonmt 
33 H araino Mta^aa (Wiboa Mimf ^§^F 
Tboy pbced tf^e arna-io Mmm irao hve gr«p^. 
ba^ed on the order and ?>anne of io^t roonfe I and 

la conalatara: with, our a^igm^m except that (!) 
tb^y grouped too N4mC and Nca^A Mt&m 
and {2} tbey aa^occi M. S®&\<mmm\% 
TIk: N4mC and MfiroA Mm^ ware ml ^immzd 
m a later vemoa of tet. wi^is (WUmi 199^), and 
wa have adopted the ixm^ndatura of that latter 
anaiy§is; Four to ten Mt®m h orn poim y were eaao 
ak^tared by Lsu^er er &L (mi), jaouhatts at aa 
and Noyer-w-feidaer ar a/. baaed on 

ovvraH seqoen.ee $im.aarity More reread v Tbaira 
al^erat a/. 0995). najr^g a skmiw a>eibo^l to ; 
pairwwa comparisons beiwoen ?antno 
rkaaefed two eoaserved moife m mklMm. to the 
two that hsd heart kfeotided feee K iimasau3ka$ 
^ a/. t989). Their a-x^erved motif (CM) h eor- 
ra^porafe to a sotif X. CM 1 to n-ohf I CM ft to aiotif 
IV and CM 10 to mohfe V end VI LJ^o^ theac, tbey 
have defmed eight groupings for tha ftltases which 
are conabtera: wtth otirx and w <i b Wtbon & Monw 

their gnmpimg^ to separata bHraC and N6tnA 
Mtaae^:. Star a our analysn. with ;;iaai^n^ier«: 

of motifi I zM-iV (Mmkm zM Methods) { k is not 
surprising that we got our grouping consistent with- 
Wifeon & Murray (tS9F. bra thtt rbsaovery of the 
save® addkksmi aooserved moUfe m mmimmil 
^dara gra>;a!y strengths the hmfa for this 
caiegoriyadon. 



The four DMA. Mumt aniingemomx seen to date 
^ P. V. and StnC) differ in' tbe linear order of 
eonaemKi nmih (Figure 2), hot an tm are 
either of the two oo p duatara int&nrup^d, Ftrdher - 
mmx conaJder the xup^rh^)ossibra §tnK:te^ nf 
thoae dusters, which aajoear to h®& the mno litmr 
order in all toiar gtoun^ fBJ/p4 ■->■ a:A/^D -e |B2/ 
1^5 ^ otB/^EyBgt^ SK and tha apparatii fo ncmml 
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fd«d:m*s <;f the AdoMet and adenine Uwiim 
mm (m* \). Ml of This Is mimm mh the 
pQ&ribtttty that the zrom after psm 

dirpheadsn converted m A^Mm-midlm Ptk^m 
km a p^jn that bound two ototeailes of Adohfe 
im Umm, 1988, lev a/;-. 1989: 

Guyot a Cm*d*6n, 1994). The two b&m cmki 
mm $mtg$4 \vbm\ mhm: the amino ^pfoxhna! or 
earba^ primal A#NM^indln§ site ewtarf to 
mm Mtemm. and then ^verged'.ftate: to yiek! 
OQtVftase (Figure 1} md the wid^e vmm of other 
A«^dep»m :MW 13802: Clarke. 

1883). lb become DNA Mtases. bv this dU$fc&R>n 
flwfel m Mmimdlmkm would tenkm die 
t^<%nMm t^m m at the e&rhoxy mmkm 
igrmp yj or between the AdoMel aocS adoome 
bindhig ate: (groups a mi. P) of the ancestral 
zdmkiz Mtm, m$ &n>hsMm mwM ixm provide 
height utotto-fa* ttal&tmg MteSi? appMr 1:q bind 
two moi^aite of AdoMet ffe^m & &f^ha*ie& 
m)Q: Arbms & BkiRfenth^ ( 1095), one of whkh 
Mkm the selectivity between su^rais m<l 
mtv-^wMc DNA sequeocea (for MMoDmm 
Bergenk & ©uaehfbeoer. imt Im&teo mMwori^, 
with nssgasi m the proposed impr^mm oi a tercet 
tm^mm region, that $mm SmC A^mee ere 
natureUy made &s swo M$mM& pdvmiMom: one 
hae mom I to VIH (mdoding both ot/f dusters} 
and the other earms the t^eg^e-reeogtji^ri;^ region 
end moiife l&aad X, and these mmkm tn'the eeli 
io farm aefive enxvme {Karreroen & de Waord. :1 3"§0r 
Lfts-araL IMS) v 

While rho S&sC and N^nC Mtases eoeh fall' into 
single groups defined bv monf mkv CwHh one 
mmpmx; MMvzi: NfeeMnw aligned to Rtoup 
$1. the Mem A Mtas&s- wo sx^te*! ere feidy evenly 
distributed among the tyycm groups (27% in group 

fend j«: im 38% each Ui 0mp$ and y}! Th:^ 
may aids? he e^platnod by a' -nodei in which the 
origlmi nociek: add Mtasefe) NemA. 

hi ?5U5:nmi^\< the DNA Mteaes appeor to be, 
p^do?a}ca% both nK?re unlferm (vhered eooserv^d 
n:i!>ii:&; N4mC Mimm m a dMmet group}' and 
more diverse {four oo^dok motif orders) than had 
h^ee expected. The solved strnemrea of Mn^ea 
ten gmupa and S should be very mh^nanve, 

Materials and fctliods 



T}::.e :>vvmpm; $m*>m>. ^ smzchm bv EC 

me m$m xkS mnencea ot 33 NemA DNA 
Ms*** m 2Xm *m nine HfeC DNA y>^e. (EC 
U\m- The »5 and ^xession mmhm of the^e 
M;sm at<? t^te4 in Fl^u^e I, ror eonm^ri^i. we are} 
osed \hx m^mm at the 5m€ Mte^ {eC 
MimUm) the mmlhmmzcm <Xmm W: 2 hX3h 

A sran of the<>^uenee$ wa^ favt aermrewd e> tems 
tnetit I and neint -V Thmz tvm hlocke proveieri ttje 
eechor poMadk giobai ai^emis: of ate meiife. Aa 
eomp^rahie mnn&'dld e.ot appear hi rkx $mw 

linear order, the ae^ee^raa were wKhia eaeh 

emup. Dor enable ;>f M,S:e}tl yimmi ir&m^m 

te>r clamy and cnnvenianee ; we rsftal» As < 



H d i\m tor the .5mG Miaseannserved modA 
and of Wite 0892) tor she Mx$w eroeps l'h« 

imz$ym : . m It. which x.o^if oesmom wer« 




v^v thank Carheone lo$eh snaky Dranrtan: (t;hiv^ii v of 
cdt-a! diae^&ar of the W^l. 
,aa Waear ^.eO Davki t. £ Dmte 
of Bdiotegh) f& dimmiim of the mmp 
Cm4 Sahmckahjer and Wm<m Saon^er 
(Frele Uoivaasast} for the eeoroioates of a^sd 
Kae-in M Ra;naa:h ;md Wlhiam hi lhae:omh (tiaevard; 
Unlveraii v) for ttie rnc^dix:ates of htHmm, Vsk aiao -mmk 
Aahok S. Bhagwat are;!: |i?an C. Dorter ivvay^ Sta^e 
Urevii^ayi. David 7. F. Dmka (Dejva^ity of bdav 
hixt-0)}, Stalky Haitemn fynM^xty of Rodmt&i 
R>::wena C Maithewx aaO }amw t }mn>U aJreversev 
Mkhk^n;, and Richard l Roberts. Sanjee Kumar, ianea 
fejae and Geoftray Wj^oo (Newoneiand hlehere tor 
heipka and -,#mmm ovAthe 

work auonomxibv *he W M. Ke>-a ! 
(to XXA, byg^a GM40^§ trom = 
of Heakh §&'&Cii ^ by gr 
FouodaOoo 



NaUoa.1 



ndana, G. A Btemeotha], R. <vL dM&), What ia the 
a;enihcsra:e of the kaerran daplk^ajn e$ th> PviM 
DNA memvkransfera^? e45^ I. ^ Aim 

\ A. & Chaa^lh;Ra>r, Vv {Ha?0}. The dnohk: nde of 
donor aoo aH^a^re? efiector of S adeooavji, 
: hit Dam methyl of E r^. tv^e A^A 




he^e:e, T IL I.audaeo. A,. Maeaherjo, K. & hmraav \( 
d^) Closing and $eeoenet^ of a eDMA m&m 
DNA mMtbste of mome mlh. I Ma. 

Omm, hi i»th Ribbons Ah, t %A M r 

mm. 

Cbm^tm^mn, S, i Smith, H. 0, Aeene aeid 

seqneoo? homcjlefp amorig iweoiy tive jt^'r'ktUJn 
aodoooi:lea$es eod fneti)y!aae:a hi Sirmku* A 
Bare^oam Aon- FmMm 6 t^atxo;?. It. D 

avSarrna, M. H., eda} : vol. A p?x m~m AxDmne 

A. Ah^Ahj;an. A, M., Ch^g, Av E^Nfeav: 
• nee W: a. & Vafdlee, A (3§SJ>; Diraei 
iiton of the ^nveaite ooA^oooiie m a DNA 
AYir>^eiea}rnethvH:omAe^>e. aax^mAnr. W. 

Che,e ; L, MmM&Mh A M: ^ Aaaime. A. L O^Bh 
Mj.aatii)nai iwpmtim of DNA htridiog h^arr catsiyais 



K, Line, 



in a DN A eytoAne 
See. m. m$Ml* 

■vm, cm, om. m?m, im ih 4 -ha 

X. flSMb). Stratum end ivnehoa: of DNA 
o^d>y;!translera^£?a. Arum, Bm&m, .mzmi. 

Struct 24, 

. K, Kumar. S, A^lai A PAagratie j. W & Rebate, 
A993ai.Crva!.el seoaaav otth.? . 



ajropkxod wEh &®kmwH^mhkx£x^ (Mi 74, 
Ch*:%,$L Human 3,. ki&tts^ta* $ & imm% & I 

amm* m*m\ mm^m of m$ mat dna 

v^thy^<^s&i«^:. OM mm Mwkv Symp. Qimt 

Gbmg, X.J^edk^i k, $&mKm& ;. & Anderson, j. E. 

DNA, Pk#0 /. 13, ^rkkeek 
Clarke, S. Prxmo ,«*e^y:ke1oo. Coo\ €fe (aH 

Gato&t*, D £.. &&hsSb*& A. M. Us, Urx<M\ U. & 
mknwK A G S, R. {1S94}. otroaue-e of 

iiedve eoeikem^:* of Dili! arid tMmz&MJwzx a? 

Dry**, D. X k, ; SieiroxN S. S & Wm&r, M, mm 
mmtxrS moMti m of e * yt >e I DNA. .«Hs&vfem$f» 
Ate? Struct. M l k3£- 

Phxoegao, k, j. & DmvM, E, S; » Seri#*i wri 
ifktm&mim by $$3t$*t$ji homok>gy of a por^fve 

M$&mfi->& & Aoassri, N. a$g&. Bkd^ stf'thft JEfcsiUi 

DNA. isokdoe of ,?, bound hrotoik No<A A<&fH&£ 
2». 3X41*324?-. 
Fwjk&s, M, .{t&& Mammte small ma&scaite i&gttoyl- 
fr-sM^isfc*' their stni<ftsr& mn\ Nmetionel fmurtis. 
Ik: I Biomem. Zl 'm7~mi 

c^-a. i a (mm., ^m^ntm^-m to smdv 

QMmhrM, P.. Kim. 1 & iPido;ekk. D. P £!3^k®^«& 
kafcste> and dxar^or-^tk;* of the human T-cA 
DNA-eyk^o S-o^:hv1^^nsf-ra^e ^eoe J CM 

G:J«]. S. & Gaodroejl mHl k^ttdeei ^poheeoce 
of mim. m<i m**&m$ ^mtem to Woe It DNA 
«K&*rYfcm^ra*^ CR AeM Sci BL Sk Vk. 317. 

(j 093J. The n>!e ^ the p^^d s^us^ :>f 



r^Ow^m Acids 'm.. TL 



*im:iiK arid n-^dxsi^mf-?- o^l^ro^^ia^j;^ of Dcm 
wnthefu: o!^;or : ack ; c^d^ :l Ouoro ^' 

■jvokakJ^xY tjr^:Ui :-?r^i i^usge T4 d<sm hc^i 

HO, O: Kv/Wtiv J. C ,. D V. & p.te. R G. (lW!5v 
St«r«oi:h^Gal ^).«d.^ -«af the C-^tiwI^ion of 
■feosytyikBv^?: e^aiy^i by HM ftwihyi'ak a?xi the 

?^:dwi^, An:k ak^fc^ i^m. 2M< 2M~m. 
Ingtmm, D,, F<>wk>v, A. V.. Sfelb^m. ]. & Ckrko. 
0^), of the n*8s0t$y\A -im&mmyi 

mmnvn M^m\^ mmh tor pmm-, dKia. rna, 

4m mmrnm^zm. i Bid Qmi zu, mm- 

26138; 

}ano.ko^s ? A,. V^v*k, li, Tkok^k^, A. Klm^i^k^ S. 
% Btkkus, v; £109-?}- Clm%$ mxi m^mm > 
of >l:o g^es eodi^ for Bw$7t iype iV s 
modific^iOft:^v«^ . ^«d. A<^.te, 20. i 




ihh::rii^-dep^odeoi methyiir^fe^e 
i;«ji^?noj> mn.ic:0.tf^ -Ibr t^ese oe-- vr$>^. Arch, 

.a & * a. a^i) a^ 

Km, K., H . Cte. R, Vawk. H, U;m, H, 

ms M^sM : cle^es GCATG (N) 10/14 3^ Nod. 

KHo^i^o^k^^ S, ;no^!^k^. A.. Meo^vicu^ 5., 
Bute&, D.. Butkus-. V 4 l^ukiim, A. 

s mofB dwaaermk cf DNA (Qta^tse N-1) 
s&irm&, -m^mv to <:dorooe sod cy. 





v X Nete J, L & Robom. P.. f fl^lK/ftjft- 
^>oci5ekv dOfO^oi of evt^itt^CS ^«:>hv- 
l^, i^ad A^ R<x>. UK i>m4>i%. 
imirm^k^ Sk Kuo^r ; ;x. Rc^b^rOi. R. !. ,^ Cheog X. 

HmI n*o;.ovkr^o^:s::^e ko>s O.s Ooio??: hm 
mi of the DMA Mxk. dii. 3^-^ 
Ko^^ykh, V: C, Schk^ion. S. L. & Hmmm, S, (1093), 
C^on^rved ^eque^ mml IWFY k\ m0m JV of tht* 
ph^-^ T4 f)^ro DNAdNo ^teioe]- 
k: ioi^or^oi ox 5^dono$.vkl ro^:0ion 
Nod. Aa^ ife 21. 3^3- ^6^ 
Konw, S.. Cheng, X.. ^ro^;e-te, S.. Mf Pmtel h 
lir&wx, R. I ^. Wikon. G. f> fij^4}. The DMA 
{cvio^i^e ol o^hyk^osfi^^^e^. Nod. 
kid. 

Lah;^hn. k Gr^o^o. j.. Sohhx^-hk^ G.. kooh^O:) D Fk 
j^k. W k.. ochiklkriHO: I & Beeogeo Vk (iflM). 
Thj^e dkoev^^oea! ^nroc:ts.ue o4 ihv: - ^{jeedk> 
DNA ^ethykrio-s^eei^e to i:00i!>^^ with d-o 

coiktor S a^ieoe^yhTiodd^ih^. )Aec. &d 

lMsm&*. D. V. S^kbl^i, :k & kevke M G99;k. m*tm 

threogfi difiereni $tohllizk5g e:i0^eiio?^ 'Pm&r 

i.mm?, k. (P?M), Dopteooo and v^rUkkm ai o 
phv!os>eootk: piloeksk: of i voe P DMA- imhvitfmv 

l&xsm, R. (l^kk Evohfeo of mm li DNA ?xioihvkr^o?>- 
ier^: : : » dop^eadoo model j. ^A Mm. m. 

mm. 

Uixs&r, P.. Kmteia A, N Gw^m^r, W (k^7k 
"fki? C:ATArCoiX)d^ : k:<:don er^yjoe IkoPV h dimfy 
■mkiX®& to tke G ArC>rocogo^!i^ tnethyl^smfem^ 
Spnlt Mm dkm k oro £; &$ mxl plmm Ti F£m 
iMtCTS, \#i De. 

Uu^tei Ik. 7<<u)toer. I A, & Nover-Vvkido<:: . ki GP^). 
Oytd^oe «p«e-ififc iy&t U DN A tummvxz^ A 

(,ee : K^iC ■K,:-Ki; N h®v/> P C:.' (MS) A ImmM 
xmimxv^miiiVim M£m\WMl x-^km i:wo oro- 
tisns" for h vitm o^hyk^oe Nud. Axm fe. 23. 

ikk l a sand, d. v; mm. mmm ^ ^ 

iiwoiidyk^o wothi^e wmttmm io., but: hi; « 
^odalfok ^^{y^, Pox:, jy^d A<M. Sd USA, Bh 




Ml S. * (s^heto, P. j. (l§9cd hkiw M Mspi md U HpzU 
deekk? svhieh b^o to Jmlwte, Nucl Adm Hm< m, 

miumm. 
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fcfc*. &. tv&m& R. i ami The dna :fftedb$ feuy 

of:*! »-&»s:hy$a$sg H Increase by ■» sksgk amino add 
suhstmal&n -in- the .£Sta3v^c^ak mi.AtnhMw. 21 , 

reductase frs>m J8&^k$m-&& & IM A ri^oki-ion- 
compii^ vv:uh the sr&yxtie from Imnrnn ervthro* 

Nwr-tf&dne*. M fc^MSser, X. A, (1g§& Msthviak*: 
of DMA ix: :>cok;>ryoo->. m HA a M^vM^a 
Mtimmv Bbmvmd mm$!e& S!&f8mc* Qm . I k 
So Safe II C edC\ WSMSl SU^ser \%3bg; 

Noyoo-VVddooo Mr> mk;ov.A Tersd*ur*». P. Cto A & 
Ih^tnsr/T A. aSM, R8o:rik a.a«^;m«m^eci&: 
DN*A fcytia^-CS) ^^Jtr^sfi^m with pro- 
ooime^d ionmo smmhzs idoA&ntv Ao & kaodiv 
of >?<k?ninc ? -N;§-DNA -R^hvlt?an8fef3^ NkX Asttr 
Sb. m .-408M871 

O'Gsrs; hU kkAloy K, X & Gii** X. 899$. 

structured****** w«« iilim^^ of 
Adoy^i d^^Jid^it j^ih^t^fe^^ &&«o 157, 
tSSAM, 

Xo£«k)Of A. L : Xhti s h.^iAm^mm t K A $«klX V 
0*$$.. On the swhamsm of ONA^nmo me*kv> 

Vashi.U^^l A. S:/ Pos&k 0. & Rnkom, R. j. WMl 

fe&$es, Nud. Afkt Rm. 17. 
ftwifei G,, po, A C. S*tfak L, Komy A A X^^eti^r^r, 
P. HWih ^mi>k?H^ni:stk>n fay deiadaai p«m of 
GGCXAspedfk. Dsn A. ^wfe^^. NmL Adds 
Rm IS WA-4P7. 
»,¥a¥ M^^o. r N. U^oy pr^^dy sfcas 
klndtes of m S-iiik^^imv&i^ tvy 
zvjm J Bid, Chm. m^mi -rM. 
.NAX Mm&y: K. A , Skoon;ak*:f, D. 0. % fei, 

■Ml ~£§'4& 

K. U> : Qmh LAWdk*. G. L & Up&c$mfc 
W ok (dd:A Tko iiymx ^xxmxirv of H^lli 



o'^U>y!i:f>)x^fem^ ixiv^fetiy i:o:::op!o^od to UNA: mi 



CvM, M, m m, 

;imk V. ^I^hm^^tmi, A. V A 
V. aWfk Sfeox^h^oistrv of no^kk: 

mm> D: % (>,mm: C, E. & B.;io\ E |. {imi On im 

ir<mSi^mm cwwm msdim (Ml $1. 9 1(1 
SmmAl M Nmmtt A. A Gfe^Ce, E, (mil: Covsfent 

kooo k^niak?o betwc^A a DMA cy^^ios: o^o>1- 

o\>^k:o^c ao<l DN A coo^iok>k S^^tmi^v Pm, 

Nm Ami. & CiSA. HI : «s4^T. 
Sdmdk G.,0^OT^ A. & (»«:.c&m^ 

<>t (htf ^h¥3t«i«s6^:. tto?Xi Ao:^to^ 

^djki^a^r. G. C>X;«r:^ ivi. ^^og^r. W & Cho^. X, 
(1^^). U^v^o mmm 4omm mmcmm or 



AcfeMei^pe^fe^ maiwk^os&r^* i M<J tiki. 
247. 16^20, 

Sthr^ud^ R A, M^^vl A„ QMi®\m%, d. KM. K. R. 
HAc W G, von ikr Boit, k k & ysa Bemd. W. ]. 
(1<^. Cry^a! #mmm of w»type k hydros 

fem>» pt)^ ^xtip»d With 4^: f Oi.0iv 

kett^k^lA ^od ;^hvt'imw--4^ 

amoxjb^^om >m of m 'mmSAh mtim. mm- 

m s ptQtm thmnvi mi * w btodhig joock of ih^ 

Fjndlfjg ^qosnco motlfe U> ^oups of feiCta^ljy 
misted p^kns- te. A^o Am, Sci USA, m, 

S. S., Ko^kox, 8, H, %wsm, h C, A Nowm*o. £ M. 
(;^k fe^^m of huum: rmthykdk^icd DMA 
lmi(fe!»fe» m& the fmiivv >d cviosiiio 
me#^:ion. /V^ ^i^ai Sfi. «8. 47&-AM. 
Soot. S, Bhsgxvst, A. S, & Frkidm&s. &im^ Nork^kio 
s^quooce ^ory^joo ®l ma mm vxmxlim iM 

Emm mmmmn mmm Nuci Acm m: u. 
$}^m. h, mm, k & Mmm&, u. i\mi xio> mi 

^ilcilm mQ6imm\m sy^m.. 0!, Pro&>nco of (wo 
doro;3ios ki /Mt nso;.kyj&se fe$.p?>?^bk fcs oiodifk 
catlosi .of -differ mh ^id<s. J. Bid 0*im 

Ik). X. mim > k Bmxxm.. K. f, Ck^o^nion. M. M A 
Bkor^<od:=5k R. M. im% 5ocii^ov>>. im:^rt:ol kom- 
ok?^ ssKi k^kdovki ^pn-:a>o of dio mm for a 
DNA-k:v:o*^ rv^-xoo^vto^odkn AfPvo Jt 
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